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1. Introduction. In their interesting paper Davies and Gather draw our 
attention to what they call the "small print" in definitions of breakdown. 
Working from a formal group structure and a notion of equivariance, they 
show by a number of examples that a definition of breakdown may be void 
if not accompanied by a reasonable and precise group structure. This leads 
them to what we would label their key remark in Section 6: "We know of no 
situation not based on equivariance considerations where it can be shown 
that the highest breakdown point for a class of reasonable functionals is less 
than 1." 

Though we agree with their general point that one has to take care not 
to come up with void definitions, or put differently, to make the small print 
explicit, we want to draw attention to the relation of their results to an 
alternative definition of breakdown. In particular, we claim that a different 
perspective on the notion of breakdown may resolve some of the small print 
issues. 

The definition used by Davies and Gather in their equation (2.4) is a 
standard one and has its roots in the domain of location and scale estimation. 
As we argued in Genton and Lucas (2003), it is less useful in a setting with 
dependent data. For example, in a simple autoregression (AR) of order 1, 



the ordinary least squares (OLS) estimator for 9 is driven to zero by replacing 
one of the It's by an arbitrarily large number. Note that the OLS estimator 
thus tends to the center rather than the edge of the parameter space. Still, 
most people would agree that the estimator has lost its usefulness if only one 
extreme outlier is added. The reason is that the estimator no longer conveys 
useful information on the uncontaminated data. It is this latter notion that 
we want to put to the fore. 
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2. Breakdown point for (in)dependent observations. First, we would like 
to acknowledge that the breakdown definition as introduced in Genton and 
Lucas (2003) is subject to criticism raised by Davies and Gather (personal 
communication). One can easily construct an example of an estimator with 
breakdown point of 1 that would lose its information content on the uncon- 
taminated process upon the addition of only one outlier. This is mainly due 
to the lack of a limiting operation in our original definition. Therefore, for 
the sake of this comment we introduce the following slightly adapted and 
simplified version of the definition in Genton and Lucas (2003). 

Let Y denote a vector containing the sample of observations, and let y de- 
note the set of allowable samples. For example, in the asymptotic case Y 
might be a specific AR(1) process, while y is the set of all stationary 
AR(1) processes. In a finite sample, Y might be a specific vector in M n , 
while y is equal to W 1 . Let zi be an additive outlier process consisting of k 
outliers of magnitude £, such that we observe Y + rather than Y . To 
formalize the notion of information content on the uncont animated process, 
we introduce the concept of badness set, which in this case we define as 



where #(•) denotes the Fisher consistent estimator functional. Let /i denote 
an appropriate measure for the badness set. In most cases the Lebesgue 
measure suffices. Then we define the breakdown point of an estimator as 



An extension to the asymptotic case is straightforward. To see how the 
definition works, consider the regression example in Section 6 of Davies 
and Gather. We have y = R nx2 and R*(0,y) = [— n,n]. The estimator is 
given by 9(Y) = max(-n, min(ra, 9 OLS (Y))), with 6 OLS (Y) the standard OLS 
estimator. We set /i to the standard Lebesgue measure. By taking k = 1 
and letting the size of the outlier (£) diverge, the intersection of the two 
badness sets in the definition becomes {n} or {— n}, which is a singleton 
with Lebesgue measure zero. Therefore, the estimator has broken according 
to our new definition. This appears reasonable as the estimator no longer 
conveys information about possibly uncontaminated samples. 



(2) 




bdp = — min< k — 1 for all compact y C 3^ : 
n I 



(3) 




3. Time series. The advantages of a different perspective on breakdown 
become even more apparent in the time series setting. Again consider our 
AR(1) example from (1). In the asymptotic case, define the i.i.d. additive 
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outlier process Z^ t with P[Zp t = £] = P[^»t = — C] = p/2, and Z^ t = oth- 
erwise. Figure 1 presents plots of the badness set R*(Z^,y) associated with 
three estimators of 6 as a function of £ for p = 5%, 25%, 50%. Here y is 
the set of all stationary AR(1) processes; see the comment in the discussion 
below. We set fi to the standard Lebesgue measure. 

The first estimator is the OLS estimator which in the above setting yields 
the badness set (2) based on the explicit expression 

Letting the size of the outliers (£) diverge, we see that unless p = 0, the 
estimator #ols tends to zero and the corresponding badness set becomes {0}; 
see the first row of Figure 1. Therefore, the asymptotic breakdown point of 
the OLS estimator for the AR(1) parameter 9 is in the setting described 
above. 

The second estimator is the least median of squares (LMS) estimator 
of 6. It yields a badness set (2) based on the expression 6*lms(^ + ^p) = 
argming e j_ 1 1 j c under the constraint 




Fig. 1. Plots of the badness set R*(Zp,y) associated with three estimators of 9 in the 
AR(1) as a function of(forp=5%, 25%, 50%: OLS (top); LMS (middle); DR (bottom). 



4 



BREAKDOWN AND GROUPS 



where r 2 = 1 + (6 — 9) 2 / (1 — 9 2 ) and x 2 ( x ] denotes the cumulative distri- 
bution function evaluated at x of a chi-square random variable with noncen- 
trality parameter 5 2 . The second row of Figure 1 indicates that the badness 
set for p = 5% still takes a continuum of values, whereas it tends to the set 
{—1,0, +1} for p = 25%. For p = 50%, the badness set collapses to {0} as £ 
diverges. Therefore, letting the size of the outliers (£) diverge, the asymp- 
totic breakdown point of the LMS estimator for the AR(1) parameter 9 can 
be computed from (5) to be 22.1% in the setting described above. 

The third estimator is the deepest regression (DR) estimator of 9 defined 
by median^lj/Yt-i)- Under the additive outlier process described above, 
we need to consider the distribution of (Yj + Z^ t ) / (Yt-i + ^5t_i)- It yields 
a badness set (2) based on the expression 9br{Y + Z^) given by the value c 
satisfying 

i = (l-p) 2 G(c;0,0) 
(6) + [G(c; C, 0) + G(c; 0, C) + G(c; -(, 0) + G(c; 0, -()} 

9 

+ V - [G(c; C, C) + G(c; (,-() + G(c; C) + G(c; -£ -()} , 

where G(x; a, b) is the cumulative distribution function evaluated at x of 
the ratio of two correlated normal random variables with means a and b, 
variances 1/(1 - 9 2 ) and correlation 9 [see Hinkley (1969)]. The third row of 
Figure 1 indicates that the badness set still takes a continuum of values for 
p = 5% and p = 25%, whereas it collapses to {0} for p = 50% as ( diverges. 
Thus, the asymptotic breakdown point of the DR estimator for the AR(1) 
parameter 9 can be computed from (6) to be 50% in the setting described 
above. 

It is interesting to note that the breakdown points of the LMS and DR 
estimators are markedly different for the AR(1) process above, whereas they 
are the same (50%) in the setting of simple regression. This indicates that 
our definition of breakdown allows us to distinguish between various robust 
estimators in the time series setting. 

4. Discussion. The definition in (3) appears less dependent on a group 
structure than the definition used by Davies and Gather. Of course, also 
the definition in (3) has its limitations. For example, the definition cannot 
be used if one wants to assess the breakdown of an estimator at a specific 
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sample, that is, if y is a singleton. The main drawback of conditioning on 
the sample is that one has to be very explicit about the region toward which 
the estimator breaks down, for example, to the edge of the parameter space. 
This may not be trivial for dependent data, as was shown in the AR(1) ex- 
ample for LMS. Moreover, conditioning the breakdown behavior on a specific 
sample may relate more to properties of the sample rather than of the esti- 
mator. The breakdown notion in (3) based on information revelation about 
the possible uncontaminated samples resolves this issue. That notion, how- 
ever, can most easily be operationalized if there is a continuum of possible 
samples, which suffices for most cases studied in the literature. 

A second possible limitation of (3) is that the user has to be explicit 
about the set y of possible samples (or processes) Y. For example, if we 
consider stationary AR(1) processes in the asymptotic setting, the (asymp- 
totic) breakdown point of the OLS estimator is 0. If, however, we consider 
AR(1) processes characterized by 9 € [—1, 1], the breakdown point is 1: the 
OLS estimator retains information about the distinction between stationary 
processes and processes with 9 arbitrarily close to 1. In that sense the esti- 
mator does not break down, while it has broken down if one only wants to 
distinguish between alternative stationary processes; see Figure 1. 

Finally, the definition in (3) is not very explicit about the measure [i. 
As mentioned, the Lebesgue measure suffices in most cases of practical in- 
terest. Despite the fact that empirical data have finite precision, one can 
work under the assumption that Y lies in a continuum to derive the break- 
down properties of the estimator. The properties derived are usually also 
relevant for a setting with finite precision data. We do not exclude, how- 
ever, that examples can be constructed where the Lebesgue measure is in- 
appropriate. For example, the parameter space may be discrete and finite. 
In such cases, alternative measures fi must be used. Additionally, the re- 
striction that the measure of the intersection of badness sets is zero may 
have to be replaced by something more complicated, like an infimum of 
M zCf i(R*(zly')nR*(0,y')) over k. 

k 

The ideas and cautionary remarks in the paper of Davies and Gather 
are important and relevant. Effectively, they promote that breakdown is 
only a useful notion for "sensible" estimators and argue that equivariance 
is the crucial notion here. We argued that they mainly build on a restricted 
notion of breakdown. The focus of future research should be put on de- 
veloping alternative definitions of breakdown that are less susceptible to 
the criticisms raised by Davies and Gather. The definition in (3) is such 
an attempt and tries to formalize the phenomena illustrated in Figure 1. 
In finite samples it is still susceptible to counterexamples, for example, 
9(Y) = max(— n, min(ra, 9 OLS (Y))) + 2(frac(Yi) — l)/n, where frac(x) denotes 
the fractional part of x, for Davies and Gather's example in Section ??, but 
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the examples become increasingly contrived. Moreover, in the asymptotic 
setting the small print issue appears to become even smaller, especially if 
we limit ourselves to estimators that are consistent and satisfy some form 
of continuity in the observations. Further developments along these lines 
appear promising. 



REFERENCES 

Genton, M. G. and Lucas, A. (2003). Comprehensive definitions of breakdown-points 
for independent and dependent observations. J. R. Stat. Soc. Ser. B Stat. Methodol. 65 
81-94. MR1959094 

Hinkley, D. V. (1969). On the ratio of two correlated normal random variables. 
Biometrika 56 635-639. MR254946 



Department of Statistics 
Texas A&M University 
College Station, Texas 77843-3143 
USA 

E-MAIL: genton@stat.tamu.edu 



Department of Finance 
ECO/FIN, Vrije Universiteit 
De Boelelaan 1105 
1081HV Amsterdam 
The Netherlands 
E-MAIL: alucasOfeweb. vu.nl 



